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Important notice 
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perceived difference in contents between such versions, the reference version is the Portable Document Format (PDF). 

In case of dispute, the reference shall be the printing on ETSI printers of the PDF version kept on a specific network drive 

within ETSI Secretariat. 

Users of the present document should be aware that the document may be subject to revision or change of status. 

Information on the current status of this and other ETSI documents is available at 

http://portal.etsi.orq/tb/status/status.asp 

If you find errors in the present document, send your comment to: 
editor@etsi.fr 

Copyright Notification 

No part may be reproduced except as authorized by written permission. 
The copyright and the foregoing restriction extend to reproduction in all media. 

© European Telecommunications Standards Institute 2002. 
All rights reserved. 
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Intellectual Property Rights 



IPRs essential or potentially essential to the present document may have been declared to ETSI. The information 
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found 
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in 
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web 
server ( http://webapp.etsi.org/IPR/home.asp ). 

Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee 
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web 
server) which are, or may be, or may become, essential to the present document. 



Foreword 

This Technical Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). 

The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or 
GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. 

The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under www.etsi.org/kev . 



ETSI 



3GPP TS 26.204 version 5.0.0 Release 5 3 ETSI TS 126 204 V5.0.0 (2002-03) 



Contents 



Intellectual Property Rights 2 

Foreword 2 

Foreword 4 

1 Scope 5 

2 References 5 

3 Definitions and abbreviations 6 

3.1 Definitions 6 

3.2 Abbreviations 6 

4 C code structure 6 

4.1 Contents of the C source code 6 

4.2 Program execution 6 

4.3 Code hierarchy 7 

4.4 Variables, constants and tables 12 

4.4.1 Description of fixed tables used in the C-code 13 

4.4.2 Static variables used in the C-code 14 

5 Homing procedure 17 

6 File formats 18 

6.1 Speech file (encoder input/decoder output) 18 

6.2 Mode control file (encoder input) 18 

6.3 Parameter bitstream file (encoder output/decoder input) 18 

Annex A (informative): Change history 19 

History 20 



£75/ 



3GPP TS 26.204 version 5.0.0 Release 5 4 ETSI TS 126 204 V5.0.0 (2002-03) 



Foreword 



rd , 



This Technical Specification (TS) has been produced by the 3 Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

X the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 
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Scope 



The present document contains an electronic copy of the ANSI-C code for the Floating-point Adaptive Multi-Rate 
Wideband codec. This floating-point codec specification is mainly targeted to be used in multimedia applications or in 
packet-based applications. The bit-exact fixed-point ANSI-C code in 3GPP TS 26.173 remains the preferred 
implementation for all applications, but the floating-point codec may be used instead of the fixed-point codec when the 
implementation platform is better suited for a floating-point implementation. It has been verified that the fixed-point 
and floating-point codecs interoperate with each other without any artifacts. 

The floating-point ANSI-C code in the present document is the only standard conforming non-bit-exact implementation 
of the Adaptive Multi-Rate Wideband speech transcoder (3GPP TS 26.190 [2]), Voice Activity Detection 
(3GPP TS 26.194 [6]), comfort noise generation (3GPP TS 26.192 [4]), and source controlled rate operation 
(3GPP TS 26.193 [5]). The floating-point code also contains example solutions for substituting and muting of lost 
frames (3GPP TS 26.191 [3]). 

The fixed-point specification in 26.173 shall remain the only allowed implementation for the 3G AMR-WB speech 
service and the use of the floating-point codec is strictly limited to other services. 

The floating-point encoder in the present document is a non-bit-exact implementation of the fixed-point encoder 
producing quality indistinguishable from that of the fixed-point encoder. The decoder in the present document is 
functionally a bit-exact implementation of the fixed-point decoder, but the code has been optimized for speed and the 
standard fixed-point libraries are not used as such. 



References 



The following documents contain provisions which, through reference in this text, constitute provisions of the present 
document. 

• References are either specific (identified by date of publication, edition number, version number, etc.) or 
non-specific. 

• For a specific reference, subsequent revisions do not apply. 

• For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including 
a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same 
Release as the present document. 

[1] 3GPP TS 26.174: "AMR speech codec, wideband; Test sequences". 

[2] 3GPP TS 26.190: "Mandatory Speech Codec speech processing functions AMR Wideband speech 

codec; Transcoding functions". 

[3] 3GPP TS 26.191: "AMR speech codec, wideband; Error concealment of lost frames". 

[4] 3GPP TS 26.192: "Mandatory Speech Codec speech processing functions AMR Wideband Speech 

Codec; Comfort noise aspects". 

[5] 3GPP TS 26.193: "AMR speech codec, wideband; Source controlled rate operation". 

[6] 3GPP TS 26.194: "Mandatory Speech Codec speech processing functions AMR Wideband speech 

codec; Voice Activity Detector (VAD)". 
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Definitions and abbreviations 



3.1 Definitions 

For the purposes of the present document, the terms and definitions given in TS 26.190 [2], TS 26.191 [3], TS 26.192 
[4], TS 26.193 [5] and TS 26.194 [6]. 

3.2 Abbreviations 

For the purposes of the present document, the following abbreviations apply: 

AMR-WB Adaptive Multi-Rate WideBand 

ANSI American National Standards Institute 

GSM Global System for Mobile communications 

I/O Input/Output 

RAM Random Access Memory 

ROM Read Only Memory 



C code structure 



This clause gives an overview of the structure of the bit-exact C code and provides an overview of the contents and 
organization of the C code attached to the present document. 

The C code has been verified on the following systems: 

IBM PC/AT compatible computers with Windows NT40 and Microsoft Visual C++ v.6.0 compiler. 

- IBM PC/AT compatible computers with Windows NT40 and Intel C/C-H- v.4.0 compiler. 

ANSI-C was selected as the programming language because portability was desirable. 

4.1 Contents of the C source code 

The C code distribution has all files in the root level. 

The distributed files with suffix "c" contain the source code and the files with suffix "h" are the header files. The ROM 
data is contained in "rom" files with suffix "c". 

Makefiles are provided for the platforms in which the C code has been verified (listed above). Once the software is 
installed, this directory will have a compiled version of encoder and decoder and all the object files. 

4.2 Program execution 

The Adaptive Multi-Rate Wideband codec is implemented in two programs: 

(encoder) speech encoder; 

(decoder) speech decoder. 
The programs should be called like: 

encoder [encoder options] <speech input file> <parameter file>; 

decoder <parameter file> <speech output file>. 

The speech files contain 16-bit linear encoded PCM speech samples and the parameter files contain encoded speech 
data and some additional flags. 



ETSI 



3GPP TS 26.204 version 5.0.0 Release 5 7 ETSI TS 126 204 V5.0.0 (2002-03) 

The encoder and decoder options will be explained by running the applications without input arguments. See the file 
readme.txt for more information on how to run the encoder and decoder programs. 



4.3 Code hierarchy 



Tables 1 and 2 are call graphs that show the functions used in the speech codec, including the functions of V AD, DTX, 
and comfort noise generation. 

Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards 
neighbouring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. 
All standard C functions: memcpyO, fwrite(), etc. have been omitted. The initialization of the static RAM (i.e. calling 
the _init functions) is also omitted. 
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Table 1 : Speech encoder call structure 



E_MAlN_encode 


E UTIL decim 12k8 


E_UTIL_down_samp | E_UTIL_interpol 




E_UTIL_decim_12k8 






E_UTIL_hp50_12k8 




E_UTIL_hp50_12k8 




E_UTIL_f_preemph 




E_DTX_vad 


E_DTX_filter_bank 


E DTX filters 




E_DTX_filter3 




E DTX level calculation 




E_DTX_decision 


E_DTX_noise_estimate_up 
date 


E_DTX_update_cntrl 


E_DTX_hangover_addition 




E_DTX_speech_estimate 






E_DTX_tx_handler 








E DTX reset 


E LPC isf init 




E_MAI N_parm_store 






E UTIL autocorr 






E_LPC_lag_wind 




E LPC lev dur 








E_LPC_a_isp_conversion 


E_LPC_chebyshev 






E_LPC_f_int_isp_find 


E_LPC_f_isp_a_conversi 
on 


E_LPC_f_isp_pol_get 




E_LPC_isp_isf_conversion 






E_GAIN_clip_isf_test 




E_LPC_a_weight 




E_UTIL_residu 




E_UTIL_deemph 




E_GAIN_lp_decim2 




E_GAIN_open_loop_search 








E_GAIN_olag_median 


E_GAIN_sort 




E_DTX_pitch_tone_detection 






E_GAIN_open_loop_search 






E_GAIN_olag_median 




E_DTX_pitch_tone_detection 




E_UTIL_residu 




E DTX buffer 








E_DTX_exe 


E DTX frame indices fin 
d 






E_DTX_isf_history_aver 




E_DTX_isf_q 


E_LPC_isf_sub_vq 




E LPC isf noise d 


E_LPC_f_isf_reorder 


E_DTX_dithering_control 






E UTIL random 




E_MAIN_reset 


E_GAIN_clip_init 




E DTX reset 




E_DTX_vad_reset 




E_LPC_isf_2s3s_quantise 


E_LPC_stage1 _isf_vq 




E_LPC_isf_sub_vq 




E_LPC_stage1_isf_vq 




E_LPC_isf_sub_vq 




E_LPC_isf_2s3s_decode 


E_LPC_isf_reorder 




E_LPC_isf_2s5s_quantise 


E_LPC_stage1 _isf_vq 






E_LPC_isf_sub_vq 




E LPC isf 2s5s decode 


E LPC isf reorder 




E_LPC_isf_isp_conversion 






E_LPC_int_isp_find 


E_LPC_isp_a_conversion 


E_LPC_isp_pol_get 


E UTIL 1 extract 


E_UTIL_mpy_32_16 


E UTIL 1 extract 




E_UTIL_mpy_32_16 




E UTIL residu 






E_DTX_buffer 




E_UTIL_residu 




E_UTIL_synthesis 




E_LPC_a_weight 




E UTIL residu 




E_UTIL_deempfi 




E_UTI L_f_preem pfi 
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E_LPC_a_weight 






E_UTIL_synthesis 




E UTIL residu 




E_LPC_a_weight 




E_UTIL_synthesis 




E_UTIL_deemph 




E_GAIN_closed_loop_search 


E GAIN norm corr 


E UTIL f convolve 




E_GAIN_norm_corr_inter 
polate 






E_GAIN_clip_test 








E_GAIN_adaptive_codebook_ 
excitation 






E_UTIL_convolve 




E_ACELP_xy1_corr 




E_ACELP_codebool<_target_u 
pdate 




E_UTIL_convolve 




E_ACELP_xy1_corr 




E_ACELP_codebool<_target_u 
pdate 




E_UTIL_f_preempli 




E_GAIN_f_pitcli_sharpening 




E_ACELP_xh_corr 




E ACELP 2t 








E_ACELP_4t 


E_ACELP_h_vec_corr1 






E ACELP h vec corr2 




E_ACELP_2pulse_search 




E_ACELP_quant_1 p_N1 




E_ACELP_quant_2p_2N1 




E_ACELP_quant_3p_3N1 


E_ACELP_quant_2p_2N1 




E_ACELP_quant_1p_N1 




E_ACELP_quant_4p_4N 


E_ACELP_quant_4p_4N1 


E ACELP quant 2p 2N 
1 


E_ACELP_quant_1 p_N1 




E_ACELP_quant_3p_3N1 




E_ACELP_quant_2p_2N1 




E_ACELP_quant_3p_3N1 




E_ACELP_quant_5p_5N 


E_ACELP_quant_3p_3N1 




E_ACELP_quant_2p_2N1 




E ACELP quant 6p 6N 
2 


E_ACELP_quant_5p_5N 




E_ACELP_quant_1p_N1 




E_ACELP_quant_4p_4N 




E_ACELP_quant_2p_2N1 




E_ACELP_quant_3p_3N1 




E_UTIL_preemph 






E_GAIN_pitch_sharpening 




E_ACELP_xy2_corr 




E_ACELP_gains_quantise 


E_UTIL_dot_product12 


E_UTIL_saturate_31 




E UTIL norm 1 




E_UTIL_normalized_inver 
SB sqrt 






E_UTIL_l_extract 




E UTIL saturate 




E_UTIL_mpy_32_16 




E_UTIL_log2_32 


E UTIL norm 1 




E_UTIL_normalized_log2 




E_UTIL_signal_up_scale 






E_UTIL_signal_down_scale 




E_GAIN_clip_pit_test 




E_UTIL_signal_down_scale 








E_GAI N_voice_factor 


E_UTIL_dot_product12 






E_UTIL_norm_l 




E_UTIL_norm_s 




E UTIL norm s 








E_UTIL_synthesis 




E_UTIL_enc_synthesis 


E_UTIL_synthesis 






E_UTIL_deemph 




E_UTIL_hp50_12k8 
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E_UTIL_random 


E_UTIL_hp400_12k8 


E_LPC_a_weight 


E_UTIL_synthesis 


E_UTIL_bp_6k_7k 


E_UTIL_bp_6k_7k 
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Table 2: Speech decoder call structure 



D_MAIN_decode 


D DTX rx handler 


D LPC isf noise d 


D LPC isf reorder 




D_DTX_exe 


D_DTX_cn_dithering 


D_UTIL_random 




D_UTIL_pow2 






D_UTIL_norm_l 




D_UTIL_random 




D_UTIL_dot_product12 


D UTIL norm 1 




D_UTIL_normalized_inver 
se_sqrt 






D_LPC_isf_isp_conversio 

n 








D_LPC_isp_a_conversion 


D_LPC_isp_pol_get 


D UTIL 1 extract 




D_UTIL_mpy_32_16 




D UTIL 1 extract 






D_UTIL_mpy_32_16 




D_UTIL_dec_synthesis 


D_UTIL_synthesis_32 




D_UTIL_deemph_32 


D_UTIL_saturate 




D_UTIL_hp50_12k8 


D UTIL 1 extract 




D_UTIL_oversamp_1 6k 


D_UTIL_up_samp 


D_UTIL_interpol | 


D_UTIL_random 






D_UTIL_signal_down_sca 

le 




D_UTIL_dot_product12 




D_UTIL_normalized_inver 
se_sqrt 




D_UTIL_hp400_12k8 


D UTIL 1 extract 




D_UTIL_norm_l 






D_LPC_isf_extrapolation 


D UTIL norm s 




D_UTIL_l_extract 




D_UTIL_mpy_32 




D_LPC_isf_isp_conversio 
n 




D_LPC_isp_a_conversion 






D_LPC_a_weight 




D_UTIL_synthesis 




D_LPC_a_weight 




D_UTIL_synthesis 




D_UTIL_bp_6k_7k 




D_UTIL_hp_7k 




D_MAIN_reset 


D_GAIN_init 




D GAIN lag concealmen 
t init 




D DTX reset 




D_LPC_isf_2s3s_decode 


D_LPC_isf_reorder 




D LPC isf 2s5s decode 


D LPC isf reorder 




D_LPC_isf_isp_conversio 
n 








D_LPC_int_isp_find 


D_LPC_isp_a_conversion 






D GAIN lag concealmen 

t 


D_GAIN_sort_lag 


D_GAIN_insert_lag 




D UTIL random 






D_GA 1 N_adapti ve_codeb 
ook excitation 








D UTIL random 






D_ACELP_decode_2t 








D_ACELP_decode_4t 


D ACELP decode 1p N 
1 






D_ACELP_add_pulse 




D ACELP decode 2p 2 
N1 




D ACELP decode 3p 3 
N1 


D ACELP decode 2p 2 
N1 




D ACELP decode 1p N 
1 




D ACELP decode 4p 4 
N 


D ACELP decode 4p 4 
N1 


D ACELP decode 2p 2 
Ml 


D ACELP decode 2p 2 
N1 


D ACELP decode 1p N 
1 
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D ACELP decode 3p 3 
N1 


D ACELP decode 2p 2 
N1 


D ACELP decode 5p 5 
N 


D ACELP decode 3p 3 
N1 


D ACELP decode 2p 2 
N1 


D ACELP decode 6p 6 
N_2 


D ACELP decode 5p 5 
N 


D ACELP decode 1p N 
1 


D ACELP decode 4p 4 
N 


D ACELP decode 2p 2 
N1 


D ACELP decode 3p 3 
N1 


D_UTIL_preemph 






D_GAIN_pitch_sharpenin 

g 




D_GAIN_decode 


D_UTIL_dot_product12 




D_UTIL_normalized_inver 
SB sqrt 




D_GAIN_median 




D UTIL 1 extract 




D_UTIL_pow2 




D_UTIL_mpy_32_16 




D_UTIL_log2 


D_UTIL_norm_l 


D_UTIL_normalized_log2 


D_UTIL_signal_up_scale 


D_UTIL_saturate 




D UTIL signal down sea 
le 






D_GAIN_find_voice_facto 
r 


D_UTIL_dot_product12 




D UTIL norm 1 




D_UTIL_norm_s 




D UTIL norm s 






D_UTIL_l_extract 




D_ACELP_phase_disper 




D_UTIL_mpy_32_16 




D UTIL 1 extract 




D_GAIN_adaptive_control 


D UTIL norm 1 




D_UTIL_inverse_sqrt 




D_UTIL_dec_synthesis 


D UTIL saturate 




D_UTIL_signal_down_sca 

le 






D_DTX_activity_update 


D_UTIL_log2 





4.4 



Variables, constants and tables 



The data types of variables and tables used in the floating-point implementation are signed integers in 2's complement 
representation, defined by: 

Words 8 bit variable 
UWordS 8 bit unsigned variable 

Wordl6 16 bit variable 
Wordl6 16 bit unsigned variable 
Word32 32 bit variable 

Floating-point numbers use the IEEE (Institute of Electrical and Electronics Engineers) format: 

Float32 8 bit exponent, 23 bit mantissa, 1 bit sign 
Float64 1 1 bit exponent, 52 bit mantissa, 1 bit sign 
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4.4.1 Description of fixed tables used in tine C-code 

This clause contains a listing of all fixed tables declared in enc_rom.c and dec_rom.c files. 

Table 3: Encoder fixed tables 



Format 


Table name 


Size 


Description 


Word 16 


E ROM cdown unusable 


7 


Attenuation factors for codebook gain in lost frames 


Word 16 


E ROM cdown usable 


7 


Attenuation factors for codebook gain in bad frames 


Float32. 


E_ROM_corrweight 


199 


Weighting of the correlation function in open loop LIP search 


Word 16 


E ROM cos 


129 


Table of cos(x) 


Float32 


E ROM dicol isf 


9*256 


1st ISF quantizer of the 1st stage 


Float32 


E ROM dicol isf noise 


2*64 


1 St ISF quantizer for comfort noise 


Float32 


E ROM dico21 isf 


3*64 


1st ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico21 isf 36b 


5*128 


1st ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Float32 


E ROM dico22 isf 


3*128 


2nd ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico22 isf 36b 


4*128 


2nd ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Float32 


E ROM dico23 isf 


3*128 


3rd ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico23 isf 36b 


7*64 


3rd ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Float32 


E ROM dico24 isf 


3*32 


4th ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico25 isf 


4*32 


5th ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico2 isf 


7*256 


2nd ISF quantizer of the 1 st stage 


Float32 


E ROM dico2 isf noise 


3*64 


2nd ISF quantizer for comfort noise 


Float32 


E ROM dico3 isf noise 


3*64 


3rd LSF quantizer for comfort noise 


Float32 


E ROM dico4 isf noise 


4*32 


4th LSF quantizer for comfort noise 


Float32 


E ROM dico5 isf noise 


4*32 


5th LSF quantizer for comfort noise 


Float32 


E ROM en adjust 


9 


Energy scaling factor for each mode during comfort noise 


Float32 


E_ROM_f_interpol_frac 


4 


LPC interpolation coefficients 


Float32 


E ROM fir 6k 7k 


31 


Bandpass FIR filter coefficients for higher band generation 


Word 16 


E ROM fir down 


120 


Downsample FIR filter coefficients 


Float32 


E_ROM_fir_ipol 


61 


Interpol FIR filter coefficients 


Word 16 


E_ROM_fir_up 


120 


Upsample FIR filter coefficients 


Float32 


E_ROM_grid 


101 


Chebyshev polynomial grid points 


Float32 


E_ROM_hamming_cos 


384 


LP analysis window 


Float32 


E ROM hp gain 


16 


High band gain table for 23.85 kbit/s mode 


Float32 


E ROM inter4 1 


4*2*4 


Interpolation filter coefficients 


Word 16 


E ROM inter4 2 


4*2*16 


Interpolation filter coefficients 


Word 16 


E_ROM_interpol_frac 


4 


Interpolation filter coefficients 


Float32 


E ROM isf 


16 


ISF table for initialization 


Word 16 


E_ROM_isp 


16 


ISP table for initialization 


Word 16 


E_ROM_isqrt 


49 


Table used in inverse square root computation 


Float32 


E ROM lag window 


16 


Lag window table 


Word 16 


E ROM log2 


33 


Table used in logarithm computation 


Float32 


E ROM f mean isf 


16 


ISF mean 


Word 16 


E ROM mean isf 


16 


ISF mean 


Float32 


E ROM mean isf noise 


16 


ISF mean for comfort noise 


Word 16 


E_ROM_pdown_unusable 


7 


Attenuation factors for adaptive codebook gain in lost frames 


Word 16 


E_ROM_pdown_usable 


7 


Attenuation factors for adaptive codebook gain in bad frames 


Word 16 


E ROM pow2 


33 


Table used in power of two computation 


Float32 


E_R0M_qua_gain6b 


2*64 


Gain quantization table for 6-bit gain quantization 


Float32 


E ROM qua gain7b 


2*128 


Gain quantization table for 7-bit gain quantization 


UwordS 


E_ROM_tipos 


36 


Starting point for codebook search 
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Table 4: Decoder fixed tables 



Format 


Table name 


Size 


Description 


Word 16 


D ROM cdown unusable 


7 


Attenuation factors for codebook gain in lost frames 


Word 16 


D ROM cdown usable 


7 


Attenuation factors for codebook gain in bad frames 


Word 16 


D ROM cos 


129 


Table of cos(x) 


Word 16 


D ROM dicol isf 


9*256 


1 st ISF quantizer of the 1 st stage 


Word 16 


D ROM died isf noise 


2*64 


1 st ISF quantizer for comfort noise 


Word 16 


D ROM dico21 isf 


3*64 


1st ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Word 16 


D ROM dico21 isf 36b 


5*128 


1st ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Word 16 


D ROM dico22 isf 


3*128 


2nd ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Word 16 


D ROM dico22 isf 36b 


4*128 


2nd ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Word 16 


D ROM dico23 isf 


3*128 


3rd ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Word 16 


D ROM dico23 isf 36b 


7*64 


3rd ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Word 16 


D ROM dico24 isf 


3*32 


4th ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Word 16 


D ROM dico25 isf 


5*32 


5th ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Word 16 


D ROM dico2 isf 


7*256 


2nd ISF quantizer of the 1 st stage 


Word 16 


D ROM dico2 isf noise 


3*64 


2nd ISF quantizer for comfort noise 


Word 16 


D ROM dico3 isf noise 


3*64 


3rd LSF quantizer for comfort noise 


Word 16 


D ROM dico4 isf noise 


4*32 


4th LSF quantizer for comfort noise 


Word 16 


D ROM dico5 isf noise 


4*32 


5th LSF quantizer for comfort noise 


Word 16 


D ROM fir 6k 7k 


31 


Bandpass FIR filter coefficients for higher band generation 


Word 16 


D ROM fir 7k 


31 


Bandpass FIR filter coefficients for higher band in 23.85 kbit/s mode 


Word 16 


D ROM fir down 


120 


Downsample FIR filter coefficients 


Word 16 


D ROM fir up 


120 


Upsample FIR filter coefficients 


Word 16 


D ROM hp gain 


16 


High band gain table for 23.85 kbit/s mode 


Word 16 


D ROM inter4 2 


4*2*16 


Interpolation filter coefficients 


Word 16 


D ROM Interpol frac 


4 


LPC interpolation coefficients 


Word 16 


D ROM isf 


16 


ISF table for initialization 


Word 16 


D ROM isp 


16 


ISP table for initialization 


Word 16 


D ROM isqrt 


49 


Table used in inverse square root computation 


Word 16 


D ROM log2 


33 


Table used in logarithm computation 


Word 16 


D ROM mean isf 


16 


ISF mean 


Word 16 


D ROM mean isf noise 


16 


ISF mean for comfort noise 


Word 16 


D ROM pdown unusable 


7 


Attenuation factors for adaptive codebook gain in lost frames 


Word 16 


D ROM pdown usable 


7 


Attenuation factors for adaptive codebook gain in bad frames 


Word 16 


D ROM ph imp low 


64 


Phase dispersion impulse response 


Word 16 


D ROM ph imp mid 


64 


Phase dispersion impulse response 


Word 16 


D ROM pow2 


33 


Table used in power of two computation 


Word 16 


D ROM qua gainGb 


2*64 


Gain quantization table for 6-bit gain quantization 


Word 16 


D ROM qua gain7b 


2*128 


Gain quantization table for 7-bit gain quantization 



4.4.2 Static variables used in tine C-code 

In this clause two tables that specify the static variables for the speech encoder and decoder respectively are shown. All 
static variables are declared within a C struct. 
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Table 5: Speech encoder static variables 



Struct name 


Variable 


Type 


Length 


Description 


Coder_State 


mem_speech 


Float32 


384 


speech buffer 




mem_wsp 


Float32 


371 


buffer holding spectral weighted speech 




mem_hp_wsp 


Float32 


243 


highpass wsp 




mem_decim 


Float32 


30 


Open-loop LTP decimation filter memory 
Estimated BP filter memory (23.85 kbit/s 




mem_hf 


Float32 


30 


mode) 
Input BP filter memory (23.85 kbit/s 




mem_hf2 


Float32 


30 


mode) 
Input LP filter memory (23.85 kbit/s 




mem_hf3 


Float32 


30 


mode) 




memjsp 


Float32 


16 


Old ISP vector 




mem_syn 


Float32 


16 


synthesis filter memory 




mem_syn2 


Float32 


16 


modified synthesis memory 




mem_syn_hf 


Float32 


16 


Higher band synthesis filter memory 




memjsf 


Float32 


16 


Old ISF vector 




mem_hf_wsp 


Float32 


9 


Open-loop lag gain filter memory 




mem_sig_in 


Float32 


4 


Prefilter memory 




mem_sig_out 


Float32 


4 


HP filter memory in the synthesis 




mem_hp400 


Float32 


4 


HP filter memory 




mem_decim2 


Float32 


3 


Open-loop LTP decimation filter memory 




mem_gp_clip 


Float32 


2 


Memory of pitch clipping 




mem_preemph 


Float32 




Preemphasis filter memory 




mem_deemph 


Float32 




Deemphasis filter memory 
Open-loop LTP deemphasis filter 




mem_wsp_df 


Float32 




memory 
Weighting filter memory (applied to error 




mem_wO 


Float32 




signal) 




mem_ol_gain 


Float32 




Open-loop gain 
Weighting level depeding on open loop 




mem_ada_w 


Float32 




pitch gain 




mem_gc_threshold 


Float32 




Noise enhancer threshold 
Higher band gain weighting factor (23.85 




mem_gain_alpha 


Float32 




kbit/s mode) 




mem_ol_lag 


Word32 


5 


Open loop lag history 




mem_TO_med 


Word32 




Weighted open loop pitch lag 




mem_exc 


Word 16 


505 


Excitation vector 




mem_isp_q 


Word 16 


16 


Old ISP vector 




mem_isf_q 


Word 16 


16 


Past quantized ISF prediction error 




mem_gain_q 


Word 16 


4 


Gain quantization memory 




mem_subfr_q 


Word 16 


4 


Scaling factor history 




mem_tilt_code 


Word 16 




Preemhasis filter memory 




memq 


Word 16 




Old scaling factor 




mem seed 


Word 16 




Random generation seed 




*vadSt 


E DTX Vad State 




See below in this table 




*dtx_encSt 


E DTX State 




See below in this table 




mem_first_frame 


UWordS 




First frame indicator 




mem_ol_wght_flg 


UWordS 




Switches lag weighting on and off 




mem vad hist 


UWordS 




VAD history 


E_DTX_State 


memjsf 


Float32 


128 


LSP history 




mem_distance 


Float32 


28 


ISF history distance matrix 




mem_distance_sum 


Float32 


8 


Sum of ISF history distances 




mem_log_en 


Float32 


8 


Logarithmic frame energy history 




mem_hist_ptr 


Word 16 


1 


Pointer to the cyclic history vectors 




mem_log_en_index 


Word 16 


1 


Index for logarithmic energy 




mem_cng_seed 


Word 16 


1 


Comfort noise excitation seed 




mem_dtx_hangover_count 


Word 16 


1 


DTX hangover period 
Counter for elapsed speech frames in 




mem_dec_ana_elapsed_count 


Word 16 


1 


DTX 


E_DTX_Vad_State 


mem_pow_sum 


Float64 


1 


Power of previous frame 




mem_bckr_est 


Float32 


12 


Background noise estimate 
Averaged input components for 




mem_ave_level 


Float32 


12 


stationary estimation 




memjeve 


Float32 


12 


Input levels of the previous frame 
Input levels calculated at the end of a 




memsubjevel 


Float32 


12 


frame (lookahead) 
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Struct name 


Variable 


Type 


Length 


Description 




mem_a_data5 


Float32 


10 


Memory for the filter bank 




mem_a_data3 


Float32 


6 


Memory for the filter bank 




mem_sp_max 


Float32 




Maximum level 




mem_speech_level 


Float32 




Estimated speech level 




mem_burst_count 


Word 16 




Counts length of a speech burst 




mem_hang_count 


Word 16 




Hangover counter 




mem_stat_count 


Word 16 




Stationary counter 




mem_vadreg 


Word 16 




Flags for intermediate VAD decisions 




mem_pitch_tone 


Word 16 




Flags for pitch and tone detection 




mem_sp_est_cnt 


Word 16 




Counter for speech level estimation 




mem sp max cnt 


Word 16 




Counts frames that contains speech 
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Table 6: Speech decoder static variables 



Struct name 


Variable 


Type 


Length 


Description 


DecoderState 


mem_gc_thres 


Word32 


1 


Threshold for noise enhancer 




mem_exc 


Word 16 


505 


INTERPOL]; /* old excitation vector 




mem_isf_buf 


Word 16 


48 


ISF buffer(frequency domain) 




mem_hf 


Word 16 


30 


HF band-pass filter memory 




mem_hf2 


Word 16 


30 


HF band-pass filter memory 




mem_hf3 


Word 16 


30 


HF band-pass filter memory 




memoversamp 


Word 16 


24 


Synthesis oversampled filter memory 




mem_gain 


Word 16 


23 


Gain decoder memory 




mem_syn_hf 


Word 16 


20 


HF synthesis memory 




memjsp 


Word 16 


16 


Old ISP (immittance spectral pairs) 




memjsf 


Word 16 


16 


Old ISF (frequency domain) 




mem_isf_q 


Word 16 


16 


Past ISF quantizer 




mem_syn_hi 


Word 16 


16 


IVIodified synthesis memory (MSB) 




mem_syn_lo 


Word 16 


16 


Modified synthesis memory (LSB) 




mem_ph_disp 


Word 16 


S 


Phase dispersion memory 




mem_sig_out 


Word 16 


6 


Hp50 filter memory for synthesis 




mem_hp400 


Word 16 


6 


Hp400 filter memory for synthesis 




memjag 


Word 16 


5 


LTP lag history 




mem_subfr_q 


Word 16 


4 


Old maximum scaling factor 




mem_tilt_code 


Word 16 




Tilt of code 




memq 


Word 16 




Old scaling factor 




mem_deemph 


Word 16 




Speech deemph filter memory 




mem_seed 


Word 16 




Random memory for frame erasure 




mem_seed2 


Word 16 




Random memory for HF generation 




mem_seed3 


Word 16 




Random memory for lag concealment 




mem_TO 


Word 16 




Old pitch lag 




mem_TO_frac 


Word 16 




Old pitch fraction lag 




mem vad hist 


UWordie 




VAD history 




dtx_decSt 


D DTX State 




See below in this table 




mem_bfi 


UWordS 




Previous BFI 




mem_state 


UWordS 




BGH state machine memory 




mem first frame 


UWordS 




First frame indicator 


dtxdecState 


mem_isf_buf 


Word 16 


12S 


ISF vector history (S frames) 




memjsf 


Word 16 


16 


ISF vector 




mem_isf_prev 


Word 16 


16 


Previous ISF vector 




mem_log_en_buf 


Word 16 


S 


Logarithmic frame energy history 




mem_true_sid_period_inv 


Word 16 




Inverse of true SID update rate 




mem_log_en 


Word 16 




Logarithmic frame energy 




mem_log_en_prev 


Word 16 




Previous logarithmic frame energy 




mem_cng_seed 


Word 16 




Comfort noise excitation seed 




mem_hist_ptr 


Word 16 




Index to beginning of LSF history 




mem_dither_seed 


Word 16 




Comfort noise dithering seed 




mem_cn_dith 


Word 16 




Background noise stationarity information 




mem_dec_ana_elapsed_count 


UWordS 




Counts elapsed speech frames after DTX 




mem_dtx_global_state 


UWordS 




DTX state flags 




mem_since_last_sid 


UWordS 




Number of frames since last SID frame 




mem_data_updated 


UWordS 




Flags CNI updates 




mem_dtx_hangover_count 


UWordS 




Counts down in hangover period 




mem_sid_frame 


UWordS 




Flags SID frames 




mem_valid_data 


UWordS 




Flags SID frames containing valid data 




mem_dtx_hangover_added 


UWordS 




Flags hangover period at end of speech 



Homing procedure 



The principles of the homing procedures are described in [2]. The present document only includes a description of the ' 
decoder homing frames. For each AMR-WB codec mode, the corresponding decoder homing frame has a fixed set of 
speech parameters. Table 7 shows the homing frame speech parameters for different modes. 
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Table 7: Table values for the decoder homing frame parameters for different modes 



Mode 



Speech Parameters 



0, 49, 1 31 , 84, 5, 50, 29, 201 5, 8,0, 2061 , 8,1 , 3560, 8,0, 2981 , 8 

0,49, 131,55,49,38,26,29,29,3, 15,7,15,8, 16,13,7, 17,16,8,0,16,20, 16,27,8,23,0,27,0,27,8 

0,49, 131,55,49,38,26,29,58, 1,7,63,127, 15,70,37, 1,209,210,224,96,31,7, 1,256,260,271, 

443, 31 , 47, 0, 400, 238, 436, 347, 31 

0,49, 131,55,49,38,26,29,58, 1,3847,3845,63, 127,70,34,0,3128,4517, 192,96,0,2, 1,4160, 

8036, 267, 443, 31 , 46, 0, 3840, 7091 , 432, 395, 31 

0, 49, 1 31 , 55, 49, 38, 26, 29, 58, 1 , 3847, 3845, 3847, 3843, 70, 31 , 0, 3648, 4764, 824, 2864, 0,6,1, 

41 60, 5220, 431 9, 71 31 , 31 , 47, 0, 1 1 2, 3764, 219,211,31 

0, 49, 131, 55, 49, 38, 26, 29, 58, 1, 3, 2, 3, 2, 7223, 703, 7223, 703, 70, 0, 1, 3, 2, 2, 3, 9475, 9483, 3090, 

8737, 0, 0, 1 , 0, 0,2,0,411 2, 4400, 841 5, 1 4047, 31 , 38, 0, 2, 1 , 3, 1 , 91 , 426, 1 3545, 1 2955, 

0, 49, 131, 55, 49, 38, 26, 29, 58, 1, 161, 759, 3, 2, 127, 516, 6167, 447, 70, 11, 1, 264, 641, 2, 3, 123, 562, 

8347, 4354, 0, 1, 1, 264, 408, 3, 0, 256, 308, 9487, 14047, 31, 46, 0, 320, 885, 2, 2, 464, 439, 11347, 

12739,0 

0, 49, 1 31 , 55, 49, 38, 26, 29, 58, 1 , 1 1 54, 1 729, 1 1 54, 1 761 , 447, 1 51 9, 959, 495, 70, 27, 1 , 1 800, 1 253, 

665, 1960, 546, 164, 1043, 335, 0, 28, 1, 580, 196, 1187, 383, 1031, 1052, 359, 1531, 31, 45, 1, 1024, 893, 

1 272, 1 920, 101, 876, 203, 1119,31 

0,49, 131,55,49,38,26,29,58, 1, 1729, 1154, 1761, 1154, 1519,959,495,447,70,3,42, 1,580, 1436, 

1362, 1250, 901, 714, 24, 45, 0, 0, 0, 1, 68, 708, 1212, 383, 1048, 1611, 1756, 1467, 31, 1, 23, 0, 1536, 

1460, 861, 1554, 410, 1368, 1008, 594, 31, 



6 File formats 

This clause describes the file formats used by the encoder and decoder programs. The test sequences defined in [1 also 
use the file formats described here. 

6.1 Speech file (encoder input/decoder output) 

Speech files read by the encoder and written by the decoder consist of 16-bit words where each word contains a 14-bit, 
left aligned speech sample. The byte order depends on the host architecture (e.g. MSByte first on SUN workstations, 
LSByte first on PCs etc.). Both the encoder and the decoder program process complete frames (of 320 samples) only. 

This means that the encoder will only process n frames if the length of the input file is n'*320 + k words, while the files 
produced by the decoder will always have a length of n*320 words. 

6.2 Mode control file (encoder input) 

The encoder program can optionally read in a mode control file which specifies the encoding mode for each frame of 
speech processed. The file is a text file containing one number per speech frame. Each line contains one of the mode 
numbers 0-8. 

6.3 Parameter bitstream file (encoder output/decoder input) 

The files produced by the speech encoder/expected by the speech decoder are described in TS26.201 that defines an 
octet-aligned frame format (Interface format 2) for the AMR-WB codec. 
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